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Abstract. In 2010, M. Studeny, R. Hemmecke, and Linder explored a new algebraic descrip- 
tion of graphical models, characteristic imsets. Compare with standard imsets, characteristic 
imsets have several advantages: they are still unique vector representative of conditional inde- 
pendence structures, they are 0-1 vectors, and they are more intuitive in terms of graphs than 
standard imsets. After defining characteristic imset polytope as the convex hull of all charac- 
teristic imsets for a given set of nodes, they also showed that a model selection in graphical 
models, which essentially is a problem of maximizing a quality criterion, can be converted into 
an integer programming problem on the characteristic imset polytope. However, this integer 
programming problem is very hard in general. Therefore, here we focus on diagnosis models 
which can be described by Bipartite graphs with a set of m nodes and a set of n nodes for any 
m,n G Z+, and their characteristic imset polytope. In this paper, first, we will show that the 
characteristic imsets for diagnosis models have very nice properties including that the number 
of non-zero coordinates is at most is n ■ (2 m — 1), and with these properties we are able to find a 
combinatorial description of all edges of the characteristic imset polytopes for diagnosis models. 
Then we prove that these characteristic imset polytopes are direct products of n many (2 m — 1) 
dimensional simplicies. Finally, we end the paper with further questions in this topic. 
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1. Introduction 

Bayesian networks (BNs), also known as belief networks, Bayes networks, Bayes(ian) models 
or probabilistic directed acyclic graphical models, find their applications to model knowledge 
in many areas, such as computational biology and bioinformatics (gene regulatory networks, 
protein structure, gene expression analysis [3] learning epistasis from GWAS data sets [I]) and 
medicine [13J. BNs are a part of the family of probabilistic graphical models (GMs). These 
graphical structures represent knowledge about probabilistic structures for a statistical model. 
More precisely, each node in the graph represents a random variable and an edge between the 
nodes represents probabilistic dependencies among the random variables corresponding to the 
nodes adjacent to the edge [BJ. BNs correspond to GM structure known as a directed acyclic 
graph (DAG) defined by the set of nodes (vertices) and the set of directed edges. 

In order to infer parameters from the observed data set, we first apply a model selection 
criterion called quality criterion, which provides a way to construct highly predictive BN models 
from data by choosing the graph which gives the given criteria, such as Bayesian Information 
Criteria (BIC) [8] or Akaike Information Criteria (AIC) pQ, maximum (see [10] for more details 
on quality criterions). Intuitively a quality criterion is a function, Q(G, D), which takes a DAG, 
G, and an observed data set, D, to evaluate how good the DAG G to explain the observed data 
D. Note that different DAGS, G\,G2 may have the same conditional independences (CIs). In 
that case we say G\,G% are Markov equivalent. When researchers wish to infer the CIs of the 
BN structure from the observed data set one represents each set of Markov equivalent graphs by 
one graph called the essential graph the corresponding Markov equivalence class of DAGs [2]. In 
this paper we focus on quality criterions Q(G,D), such that Q{G\,D) = Q(G2,D) if and only 
if G\, C?2 are Markov equivalent. 
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Since in general there are super exponentially many essential graphs with a fixed set of nodes 
N, maximizing the quality criterion, Q(G, D), over all possible essential graphs with N is known 
to be NP-hard. Studeny developed an algebraic representation of each essential graph G called 
a standard imset, of G, which is an integral vector representation of G in M 2|JV ' _ l-W|-i_ From 
the view of this setting a criterion function Q(G,D) is a dot product of vectors in IR 2 '^' - !^! -1 . 
In 2010, M. Studeny, J. Vomlel, and R. Hemmecke showed that maximizing the Q(G, D) over 
all essential graphs can be formulated as a linear programming problem over the convex hull 
of standard imsets for all possible essential graphs [12 1. This gives us a systematic way to 
find the best criterion with the optimality certificate rather than finding the best criterion by 
the brute-force search. Then M. Studeny, R. Hemmecke, and Linder explored an alternative 
vector representative of the BN structure, called characteristic imsets. Compare with standard 
imsets, characteristic imsets have several advantages: they are still unique vector representative 
of conditional independence structures; they are 0-1 vectors; and they are more intuitive in 
terms of graphs than standard imsets |llj . 

In general, however, the dimension of the convex hull of the characteristic imsets with the 
fixed set of nodes N, called characteristic imset polytope, is exponentially large and there are 
double exponentially many vertices as well as facets of the characteristic imset polytope. Thus it 
is infeasible to optimize by software if \N\ > 6. In order to solve the LP problem for a larger \N\, 
we need to understand the structure of the characteristic imset polytope, such as combinatorial 
description of edges and facets of the polytope so that we might be able to apply a simplex 
method to find an optimal solution. However, in general, it is challenging because there are too 
many facets and too many edges of the polytope. Therefore here we focus on a particular family 
of BN models, namely diagnosis models. 

In medical studies, researchers are often interested in probabilistic models in order for them 
to correctly diagnose a disease from a patient symptoms. The diagnoses models, also known as 
the Quick Medical Reference (QMR) diagnostic model, is introduced in [9] to diagnose a disease 
from a given set of symptoms of a patient. Therefore, here we focus on diagnosis models (e.g., 
[7])- Under this model, a DAG representing the model is a bipartite graph with two sets of 
nodes, one representing m diseases and one representing n symptoms, and set of directed edges 
from nodes representing diseases to nodes representing symptoms (see Figure [T] for an example) . 

In this paper, first, we will show that the dimension of the characteristic imset polytope for 
diagnosis models with m diseases and n symptoms is n ■ (2 m — 1) which is much smaller than 
2(m+n) _ ^ n _|_ m ~j _ ^ Second, we are able to find an explicit combinatorial description of all 
edges of the characteristic imset polytopes for diagnosis models with fixed m and n, that is, 
if G\, G2 are graphs representing two diagnosis models such that all symptoms have the same 
parents in Gi and in G2 except one symptom, then the characteristic imsets representing G±, G2 
form an edge of the characteristic imset polytope for diagnosis models. Then we prove that these 
characteristic imset polytopes are direct products of n many (2 m — 1) dimensional simplicies. 

This paper is organized as follows. In Section [2] we introduce notation and state some defi- 
nitions. Section [3] shows propositions and their proofs, and Section [4] shows our main results. 
Then we will show some examples in Section [5] and discuss our future work in Section [6) 

2. Notation and definitions 
In this section we state some notation and remind readers some definitions. 

Definition 2.1. A Diagnosis Model can be described by a Bipartite Graph whose nodes 
N = {ai, . . . , a m } U {bi, . . . , b n } can be divided into disjoint sets A = {ai, . . . , a m } and B = 
. . . , b n }. Nodes in A can be interpreted as diseases and nodes in B can be interpreted as 
symptoms. Every single edge can only be drawn from a disease to a symptom. An example is 
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given by Figure [T} 
Define notation: Q m n 
n}. 



{All possible Bipartite graphs defined in Definition 2.1 for fixed m and 




Figure 1. An example of Bipartite Graph, m = 3, n = 6. 



3. Propositions and Proofs 
Recall that we have the definition of Characteristic Imset. 

Definition 3.1. Let G be an acyclic directed graph over N. The characteristic imset for G 

can be introduced as a zero-one vector c G with components cc(S) where S C N, \S\ > 2 given 
by 

cg(S) = 1 <^=^ 3 i G S such that j G pa G (i) for V j G ^{i} 
where j G pa G (i) means G includes the edge from j to i. 

Proposition 3.2. Assume \N\ > 2 and G is a Bipartite graph. A = {a±, . . . ,a m } and B = 
. . . , b n } are defined in Definition 2.1 Then cg(T) is possible to take value 1 if and only if 
T has the form of . . . a,i k bj, where 1 < k < m, {ii, . . . , i^} Q {1, . . . , m} and j G {1, . . . , n}. 

Proof. Notice that for VTCiV, \T\ > 2, we can write T in the following form: 



(3.1) 



T = . . . ai k bj x . . .bj v where < k < m, . . . , ik} C {1, . . . , m}, 

0<Z<n,O 1) ...,j z }C{l,...,n}, 
k + I > 2. 



Now we need to prove that I can neither be nor greater than 1. 

(a) If I = 0. For V s, t G {ii, . . . , ik}, by Definition 2.1 there cannot be an edge from a s to 
at- This means a s ^ pao(at)- Hence V i S . . . ,ik}, T \ {a t } ^ pac(at)- cg(T) = 0. 

(b) If Z > 1. Similar with above, by Definition 2.1, for V s',t' G {ji, . . . , ji}, b s > ^ pacibf). 



Moreover, for V t G {h, . . . ,i k } and t' G {ji, . . . ,ji}, b t , pa G (a t ). c G (T) = 0. 



□ 



Proposition 3.3. Notation same as Proposition 3.2, Suppose T has the form of . . . ai k bj, 



where 1 < k < m, {ii, . . . , ik} C {1, . . . , m} and j G {1, . . . , n}, then cq{T) 



Proof. As shown in part (a) in the proof of Proposition 3.2 , for V s, t G {i±, 
Therefore: 



,ik}, a s i pa G (a t ). 



c G (T) = 1 if and only if {a h . . . a ik } C pa G (bj); 
If and only if a s G pa G (bj), for V s = i\, . . . , ik] 
If and only if c G {a s bj) = 1, for V s = i\, . . . , ik- 

Recall that c G (T) is binary. Thus c G (T) = Y\ s=il ifc c G (a s bj). □ 



1 



JING XI AND RURIKO YOSHIDA 



Remark 3.4. Another way to see Proposition 3.3 is that for a diagnosis model, the whole 
characteristic imset is determined by only m ■ n coordinates, cciciibj), i G {l,...,m}, j G 
{1, . . . , n}, which can also be interpreted as the existence of edge point from a% to bj. From this 
remark, it is straightforward to get Proposition 3.6 For a related example, see Example 1 5. 1| 



Remark 3.5. Proposition 3.3 also implies that V G G G m ,n, G can be determined by pac{bj) 
bj G B, and pac(bj), bj G B are completely irrelevant. 



Proposition 3.6. Notation same as Definition \2.1\ Fix m and n. The number of elements in 

Proof. Consider a bipartite graph G G Q m ,n- At most m ■ n possible edges can be assigned from 
nodes in a set A to nodes in a set B. Note that there are 



E 



mn 
k 



k=0 

many ways to assign edges from nodes in a set A to nodes in a set B. □ 



Proposition 3.7. Notation same as Definition 2.1. Fix m and n. If we consider the charac- 
teristic imset for an arbitrary Bipartite graph Vfi Qui ii ? the fiiimbcT of tloti-zgto coovdindtcs is at 
mostn- (2 m - 1). 

Proof. Consider T by classify its cardinality. 



When 2 < |T| < m + 1, by Proposition 3.2, the coordinate related to T can be 1 if and 
only if T has the form of . . . a%, T ,_ x bj^ {h, . . . , i\r\-i} Q {1 3 • • ■ j m } an d j G {1, . . . , n}. 

Fix 71. (he number of this kind of T is: ( .^-.r^ 1 ■ //. 



• When \T\ > m + 1,3 b h , b h G {1, . . . , n} s.t. b h , b h G T. By Proposition [3^2} c G (T) = 0. 
Therefore, the number of possible non-zero coordinates is: 

m+l 

£ (pv-i )■"="■£( * )=» p'"- 1 )- 

|T|=2 V 7 fc=l 



□ 



Remark 3.8. 



• Notation same as Definition 2.1 For a fixed TV, by Proposition 3.2 and 3.7, we can define 
S m: n as the support of {c G '■ G G Q m ,n}- We know that: 

{T : 3 G G g m , n such that c G (T) = 1} C P(/V) 

where V(N) is the power set of N. 

• Recall that in elementary geometry, 

— a closed convex polyhedron (which will be indicated as polyhedron for short) 
in M q can be defined by a system of linear inequalities: 

{x G R q : Ax. < b} 
where A is a p x q matrix in W xq and b is a p x 1 vector in M pxl ; 

— a closed convex polytope (which will be indicated as polytope for short) is 
defined as the convex hull of a finite set of points; 

— if a polyhedron is bounded, then it is a polytope. 

• A d-simplex is a d-dimensional polytope which has exactly d + 1 vertices. It is notated 
as A d . 

• For more details on polyhedral geometry see 
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Notice that Proposition 3.7 also implies: for fixed m and n, the dimension of the poly- 
tope of characteristic imsets for all elements in Q m ,n (we call it characteristic imset 
polytope in the following and use P mjn as the notation) is at most n ■ (2 m — 1). We 
will prove that it is actually exactly n ■ (2 m — 1). 

Because characteristic imsets are all 0-1 vectors, it is obvious that for fixed m and n, the 
set of vertices of characteristic imset polytope is exactly {cq '■ G G G m ,n}- 



4. Theorems and Proofs 



Theorem 4.1. Notation same as Definition 2.1, Fix m and n. The dimension of the charac- 
teristic imset polytope is exactly n ■ (2 m — 1). 

Proof. For an N, we can consider the standard basis ex, T C N, which have the same coordi- 
nates with characteristic imsets and ex (To) = 1 if and only if To = T, V To C N. 
It is obvious that l.){c G , G G G m ,n} C R 2m+n ~( m + n + 2.){e T ,T G 5 m ,„} is a basis of M n '( 2m_1 ) 
which is embedded in R2™+»-(m+n+l) and 3 .){ CG) G G g m>n y 

can be written as a linear com- 
bination of {ex,T G 5 m ,n}- We are going to prove that {ex,T G S m ,n} can be expressed as a 
linear combination of {cq, G G Q m ,n}- 

Notice that {ex,T G S mtn } is equivalent with {ex,T C iV and T has the form of . . .ai k bj, 
where 1 < k < m, . . . , ik} Q {1, • • • , m} and j G {1, . . . , n}}, we can prove the theorem by 
induction. 

• When k = 1, i.e. T = aibj where a« G A and bj G B, we can choose graph G which has 
only one edge (aibj). Then cc = e T - 

• Consider T^ = a^.-.a^bj where 1 < k < m, {ii,.--,ik} Q {!)••• -.m} and j G 
{l,...,n}. Suppose for V T, |T| < k, ex can be written as a linear combination of 
{cg, G G G m ,n}- 

Let G be the Bipartite graph with k edges: (a^bj), I = 1 . . . k. Then: 
e Tk = c G - e T a u{bj} 

T a C{ai lt -,<H k },0<\T a \<k 

By assumption, ex a ubj can be expressed as a linear combination of {cq, G G G m ,n\i f° r 
V T a C {ajj , . . . , aj fe }, < |T a | < fc, therefore, ex k can be written as a linear combination 
of {c G , G G £ m ,„}. 

□ 



Remark 4.2. Notice that for the special case n = 1, Theorem |4.1| and Proposition 3.6 claim 
that P m i has 2 m vertices and the dimension of P mjn is n • (2 m — 1). This directly lead to 
Theorem WM 



Theorem 4.3. Fix m, P m i is a simplex with dimension 2 m — 1, i.e. P TO l = &2 m -i- 
4.1. Combinatorial Description of Edges on P TO) „. 

Definition 4.4. Graphs G, H G <5 m ,n are called neighbors if cg and ch form an edge in the 
characteristic imset polytope. 



Lemma 4.5. Notation same as Definition 2.1. Fix m, then for arbitrary two distinct graphs, 
G\, (?2 G £7 m ,lj G*i and G2 are neighbors, i.e. cq 1 and cg 2 form an edge in the characteristic 
imset polytope. 
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Proof. Let N = {a±, . . . , a m , bi}. We are going to prove 3 a cost vector w, such that w ■ c Gl 
w ■ cg 2 > vv ' cg 3 for V G3 £ Qm,n distinct with G\ and G 2 . 



By Remark 3.5, G\ and G2 are determined by pa Gl (&i) and paG 2 {b\), respectively. Let's discuss 
by two scenarios of pa Gl (&i) and paG 2 {b\). 

(1) One is a subset of the other. WLOG, suppose pa Gl (&i) C paG 2 {b\). 

Define notations: A\ = pa Gl {b\), A 2 = pa G2 {b\), A 2 \\ = paG 2 {h)\pa Gl (b 1 ), and 
Acomp = [paG 2 {b\)) c . Notice that A 2 \\ 7^ 0, A\ and A comp can be 0. Ai, A 2 u and 
^4comp is a partition of N. 

Consider a cost vector w whose coordinates match the coordinates of characteristic im- 
sets. 



— If |^2\il > 1) define w as: 
w(T) = 



c 


for 


T 


—c 


for 


T 


\A 2 \l\ 


• c for 


T 





for 


T 



aibj, ai G A\ 
aibj, ai ^ A\ 
A 2 \i U {61} 
T C N, \T\ > 2, and T / A 2 \i U {61} 

where c is a positive number. 
Then for V G 3 G Qm.ni we have: 

^•CGs = l^i npa G3 (6i)| ■ c- |pog 3 (6i)\Ai| • c+ IA^jJ • c- cg 3 (^ 2 \i u { & i}) 

= \A\ npa G3 (6i)| • c - |pog 3 (6i) n ^ 2 \il • c - |pa G3 (bi) n vl CO m P | • c+ |^ 2 \il • c ■ c G3 (^ 2 \i U {61}) 

Now notice that in the above equation: 

* \A\ r\paG 3 (bi)\ ■ c < \Ai\ ■ c, where "=" holds if and only if A\ C pa Ga (6i); 

* — \paG 3 (b\) HA 2 \i \ • c + \ A 2 \i\ ■ c- cg 3 (A 2 \ 1 U {61}) < 0, where "=" holds if and 
only if pa Gs (bi) n ^4 2 \i = or A 2 \i', 

* — \paG 3 (bi)r\A comp \-c < 0, where "=" holds if and only if paG 3 (bi)r\A comp = 0. 
Therefore, w ■ c Ga < \A\ \ ■ c where "=" holds if and only if G3 = G\ or G 2 . 

— If \A 2 \i\ = 1, let A 2 \i = {a q }, define w as: 



w(T) 



c for T = aibj, ai G Ai 

—c for T = aibj, ai £ A 2 

for T = a q b\ 

for T CN, \T\ > 2, and T / A 2 \ x U {61} 



where c is a positive number. 
Then for V G 3 G Gm,n, we have: 

w • c G3 = I npa Gs (6i)| • c - |pa Gs (6i) n ^ CO m P | • c 

Now again notice that in the above equation: 

* \Ai npa G3 (6i)| • c < |^4.i I • c, where "=" holds if and only if A\ C paG 3 {b\); 

* —\paG 3 (bi)(lA comp \-c < 0, where "=" holds if and only if paG 3 (bi)nA comp = 0. 
To satisfy the above two conditions, we must have paG 3 {b\) = A\ or (Ai U a q ). 
Therefore, again we have: w ■ c Ga < \Ax\ • c where "=" holds if and only if G3 = G\ 
or G 2 . 

(2) Neither one is a subset of the other. 

Define notations: A\ = pa Gl {b\), A 2 = pa G2 (h), A ln2 = pa Gl (h) npa G2 (h), A x \ 2 = 
pa Gl {bi)\pa G2 {bi), A 2 \i = pa G2 {bi)\pa Gl {bi) , ^iu2 = pa Gl (h) U pa G2 (h) and A comp = 
{Aiu2) c - Notice that A x \ 2 , A 2 \i / 0, A in2 and A comp can be 0. A ln2 , A x \ 2 , A 2 \i, and 
A CO mp is a partition of N. 
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Consider a cost vector w whose coordinates match the coordinates of characteristic im- 
sets. 



c 


for 


T = aibj, ai £ Ai n2 


— c 


for 


T = aibj, ai £ Ai n2 


-2c 


for 


T = A A2 U A 2V U {61} 


(L4i\ 2 | + l)-c 


for 


T = A A2 U{b 1 } 


(|^ 2 \i| + l)-c 


for 


T = A 2V U {h} 





for 


other T C N, \T\ > 2 



— If \Ai\ 2 \ > 1 and |-A 2 \i| > 1- Define w as: 



w(T) 



where c is a positive number. 
Now for V G3 G Q m ,ni we have: 

w-c G3 = \paa 3 (61 ) n A ln2 1 • c - \pa G[i (61) n A x \ 2 1 ■ c 

-|paG 3 (6i) n A 2 \i| • c - |paG 3 (6i) n .4 comp | • c 

+(|^i\ 2 | + 1) • c • cg s (A iN2 U {h}) + (|A 2U | + 1) • c • c G3 (A 2V U {61}) 
-2c-c G3 (A A2 UA 2 \ 1 U{6i}) 
= \po-G 3 (bi) n Ai n2 | • c 

-|pa Gs (6i) n A A2 | • c + (\A A2 \ + 1) • c • cg s (A iN2 U {61}) 
-|pa Gs (6i) n A 2V \ ■ c + (\A 2V \ + 1) • c • c G3 (A 2V U {61}) 
-2c-c G3 (A A2 UA 2 \ 1 U{6i}) 



■|pa Gs (6i) n A, 



comp 



■ C 



Now notice that in the above equation: 

* \pac 3 (bi)n A\ n2 1 • c < I Ai n2 1 • c where " =" holds if and only if Ai n2 C pa G . 3 (£>i ) ; 

* -|pog 3 (6i) n Ai\ 2 \ ■ c+ (|^4i\ 2 | + 1) • c • c G3 (A 1 \ 2 U {61}) < c where "=" holds 
if and only if A^ 2 C pa G . 3 {bi); 

* -\paG 3 (bi) n A 2 \i| • c+ (|A 2U | + 1) • c • c G3 (A Al U {61}) < c where "=" holds 
if and only if A 2 \i C pa Ga (6i); 

* -2c-c G3 (A 1 \ 2 U^ 2 \iU{6i}) < where "=" holds if and only if (A ± \ 2 U A 2 \ ± ) (t 
paG 3 (bi); 

* — \paG 3 (bi)r\A comp \ -c < where "=" holds if and only if pa G3 (bi)nA comp = 0. 
The above conditions cannot be satisfied simultaneously, but after we figure out 
that: 

* when pa G . 3 (b 1 ) = A ln2 , w ■ c G . 3 = \A ln2 \ - c + + + + = \A ln2 \ ■ c; 

* when pa Gz (b\) = A\, i.e. G3 = G\, w ■ c G . 3 = \A\ n2 \ - c + c + + + = 
(|Am2| + l)-c; 

* when pa G . 3 (bi) = A 2 , i.e. G3 = G 2 , w ■ c G . 3 = \A\ra\ - c + O + c + + = 
(|Am2| + l)-c; 

* when pa G3 (h) = Ai u2 , w ■ cq 3 = \Ai n2 \ - c + c + c-2c + = |Ai n2 | • c, 

it becomes obvious that: w • c G3 < (|Ai n2 | + 1) • c where "=" holds if and only if 
G 3 = Gi or G 2 . 

If only one of |A 1 ^ 2 | and \A 2 \\\ is 1. Suppose |-A]\ 2 | = 1 and |^4 2 \il > 1- Define w as: 



w(T) = < 



c for T = a^j, ai 6 A\ 

— c for T = a^j, ai ^ A\ 

-2c for T = A x \ 2 U A 2 \i U {61} 

(l^2\i| + 1) ■ c for T = A 2 \ l U{b 1 } 

for other T C N, \T\ > 2 
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where c is a positive number. 
Now for V G3 G Q m ,ni we have: 

w-c G3 = |poG 3 (fei)n^i n2 | • c+ |pa G g(6i) nAx\ 2 \ " c 

-|pa G3 (6i) n A 2 \i\ ■ c - \pa G3 (bi) n A comp | ■ c 
+(|A 2 \i| + 1) • c • c G3 (A 2V U {bi}) - 2c • c G 3(^ lV2 U A 2V U {61}) 
= \paG 3 (bi) n A ln2 | • c 
+ba<3 3 (61 ) n^!\ 2 1 • c 

-|pa Gs (&i) n A 2 \i| • c + (|A 2 \i| + 1) • c • c G3 (^ 2Vl U {61}) 
-2c-cg s (-Ai\2UA 2U U{6i}) 
-|pa G3 (6i) n 
Now notice that in the above equation: 

* \pac 3 (b\ ) H Ai n2 1 • c < I Ai n2 1 • c where " =" holds if and only if Ai n2 C pac 3 (bi ) ; 

* \pO'G 3 {bi) fl Ai\ 2 | • c < c where "=" holds if and only if A^ 2 C pa G3 (6i); 

* -|pog 3 (6i) n A 2 \i| • c+ (|A 2 \i| + 1) • c • c G3 (A 2 \! U {61}) < c where "=" holds 
if and only if A 2 \i C pa Ga (bi); 

* -2c-c G3 (A 1 \ 2 Uyl2\iU{6i}) < where "=" holds if and only if (A^UA^x) 
paG 3 (bi); 

* — \paG 3 (bi)r\A comp \ c < where "=" holds if and only if paG 3 (bi)r\A comp = 0. 
The above conditions cannot be satisfied simultaneously, but similar with first case 
in (2), it can be shown that: w ■ c Ga < (|Ai n2 | + 1) • c where "=" holds if and only 
if G 3 = Gi or G 2 . 

— If I An 2 1 = |A 2 \i| = 1. Define w as: 



w(T) 



c for T = aibj, di G Ai u2 

-c for T = aibj, ai g Ai u2 

-2c for T = A 1 \ 2 UA 2 \ l U{h} 

for other T C N, \T\ > 2 



where c is a positive number. 
Now for V G3 £ Q m ,ni we have: 

w-c G3 = \pdG 3 (bi) n A ln2 | • c 

+\pa Gs {bi) D A^ 2 \ -c+ |pa Ga (&i) n A 2 \i| • c 
-2c-c G3 (A 1 \ 2 UA 2 \ 1 U{6i}) 

-\paG 3 (h) n A comp | • c 

Now notice that in the above equation: 

* \po-G 3 (fri) H Ai n2 1 • c < I Ai n2 1 • c where " =" holds if and only if Ai n2 C pa G3 (61 ) ; 

* IpuGaibi) n An 2 | • c < c where "=" holds if and only if An 2 C pa Gs (6i); 

* |pa Gs (6i) fl j4 2 \iI • c < c where "=" holds if and only if A%\i C pa Ga (6i); 

* -2c-c G3 (A!\ 2 UA 2 \iU{6i}) < where "=" holds if and only if (A 1 \ 2 UA 2 \i) 
pa G3 (bi); 

* — \pa,G 3 (bi)riA corn p\ c < where "=" holds if and only if paG 3 (bi)r\A comp = 0. 
The above conditions cannot be satisfied simultaneously, but similar with first case 
in (2), it can be shown that: w ■ c Ga < (|Am 2 | + 1) • c where "=" holds if and only 
if G 3 = G x or G 2 . 

□ 



4.6. We can understand Lemma 4.5 better after we introduce a new concept "tuft". 
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• A structure is called k-tuft if it in- 
cludes (k + 1) nodes such that there 
exists one node (the offspring) that 
all other nodes are its parents. There 
are two important special cases: an 1- 
tuft is just an edge and a 2-tuft is just 
an immorality. An example of 6-tuft is 
given in the right hand side graph. 




A 6-Tuft Structure 



Fix m and n. We say a change from one graph G\ to another one G2 is swapping 
turves if: G2 can be obtained from G\ by removing a tuft and adding another tuft, 
where these two turves share the same offspring but disjoint parents. An example is 
given as below: G2 can be obtained from G\ by removing a 2-tuft and add an 1-tuft, 
where the two turves share the same offspring 62- 




Graph Gi 



Graph G2 



Lemma 4.5 gives us a principal idea of finding neighbors for a graph: add or remove a 



tuft, or swap turves once. We are going to prove this in general case in Theorem 4.7 



Theorem 4.7. Notation same as Definition \2. 11 Fix m and n. Two graphs, G\, G2 £ Qm,n 
are neighbors if and only if3bi £ B such that pa Gl (bi) 7^ pa G2 (bi) and pa Gl (bj) = pa G2 (bj), for 
V bj £ B and bj 7^ bi, i.e., G2 can be obtained from G\ by removing or adding a tuft or swapping 
tufts. 

Proof. We will prove "if and "only if separately. 
(1) Prove "if part. 

Suppose we have two graphs, G\, G2 £ G m ,n, and 3 bj £ B such that pa Gl (bi) 7^ pa G2 (bi) 
and pa Gl (bj) = pa G2 (bj), for V bj G B and bj 7^ 6j. We need to prove G\ and G2 are 
neighbors. 

Consider an arbitrary graph G3 G Q m ,n- We want to prove that 3 a cost vector w such 
that w ■ cg x = w ■ cq 2 > w ■ cq 3 where "=" holds if and only if G3 = G\ or (?2- 



Define the following graphs (see Remark 4.8 for example): 

★ G[, G' 2 , G' 3 £ Qm,\ with symptom B nh i = {bi} such that paQ> (pi) = paa 1 (bi 
pacdh) =pa G2 (bi) andpa G /(6,) =pa G3 {h)\ 



•k Go, Go e Q, 



m,(n— 1) 



l) with symptoms B, 



m,(n— 1) 



B\{bi} such that paa (bj) 



pa Gl (bj) = pa G2 (bj) and pa G »(bj) = pa G3 (bj), V bj £ 5 m , („_!). 
Notice that the connections of the characteristic imsets of these graphs are very simple. 



By Remark 3.5, after moving the coordinates properly, we can write the characteristic 
imsets of G\ , G2 and G3 in the form of: 



CfJ 2 
C G 3 



(cg[ c Go ) 
(c& 2 CGo) 
(cg' 3 c g ») 
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— As proved in Lemma 4.5 G\ and G' 2 are neighbors. This means that 3 related cost 
vector u>i such that w\ ■ c G > = w\ ■ c G i > wi ■ c G > for V G' 3 G Q m ,li where "=" holds 
if and only if G' 3 = G[ or G' 2 . 

— Because c Go is a vertex of the characteristic imset polytope related to G m ,(n-i)i we 
can find a related cost vector W2 such that W2 • cg > W2 • c G " for V G 3 G Q m ,(n-X)i 
where "=" holds if and only if G' 3 ' = G . 

Now let w = (w\ W2) with the new permutation of coordinates. We have: 

w ■ c Gl = w\- c G > x + W2 ■ c Go 

= Wi ■ C G > 2 +W 2 - C Go = w ■ °G 2 
> Wl ■ C G > 3 + W 2 ■ C G 'l = W -C G3 

where "=" holds if and only if i) G' 3 = G[ or G' 2 , and ii) G' 3 ' = G , i.e. G 3 = Gi or G 2 . 
(2) Prove "only if part. 

Suppose we have two graphs, Gi, G2 G Q m ,n, which are neighbors, i.e. 3 a cost vector w 
such that w ■ cg 1 = w ■ cg 2 > w ■ cg for V G G Q m>n , G 7^ Gi, G2. We are going to prove 
by contradiction. 

Suppose 3 bi, bj G B distinct, paG^bi) ^ paG 2 (bi) and po-Gi^j) P a G 2 ^j)- 
Define the following graphs (see Remark |4.8| for example): 

* G[, G' 2 G Gm,i with symptom B m ^\ = {bi} such that paG'^i) = P^dih) and 
pa G ' 2 (k) =pa G2 {bi); 

* G'l, G'2 G Q m ,\ with symptom B m> i = {bj} such that pdG'{(bj) = pa Gl (bj) and 
Pacfoj) =paG 2 (bj); 

* G'l', G'2' G G m ,(n-2) with symptoms -B m ,(n-2) = B\{bi,bj} such that pa G '('{b k ) = 
pa Gl {b k ) and pa G »>(b k ) = pa G2 {b k ), V b k G 5 m ,( n _ 2 ); 

* G3 G Qm.n is all same with Gi but pa,G 3 (bi) = pa G2 (b{); 

* G4 G £/ m , n is all same with Gi but pa G4 (bj) = pa G2 (bj); 

* G5 G QuiyTi is all same with G 2 but pa G5 (bi) = pa Gl (bi) and pa G5 (bj) = pa Gl (bj), 
notice that G5 might be same with G\. 

Similar with (1), after moving the coordinates properly, we can write the characteristic 
imsets of Gi, G2, G3, G4 and G5 in the following form: 

CGi = ( c G'j C G » C G '») 

cg 2 = {c G ' 2 c G » c G »/) 

C G 3 = {c G ' 2 c G » C G '») 

cg 4 = {cg[ c g » c g /») 

c g 5 = {cg[ c g » c g »') 

Now use the same permutation of coordinates in w and do the same partition, we can 
write uu in the form of w = (u>i W2 w 3 ). By the assumption we indicated at the beginning 
of this part, we have: 

* It is obvious that G3 7^ G\ or G2. So: 

w-c Gl = wi- c G > x + w 2 ■ c G >( + w 3 ■ Cqw 

> W-C Gs = W 1 • C G ' 2 + 102 • C G » + W 3 • C G '» 

=> m ■ c G / > Wi • C G / 

* It is obvious that G4 / Gi or G2. So: 

w • c Gl = wi - c G ' x + W2 ■ C G n + • C G '» 

> W • C G4 = Wl • Cc^ + W2 ■ C G n + W3 ■ C G '(< 
=^ W2 • C G » > W2 • C G » 
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Then we have: 



W ■ C G2 = Wl- C G / + W 2 ■ C G » + W 3 ■ C G rn 

< Wl ■ C-G\ + w 2 ■ CG" + W3 ■ C G m = W ■ C Gb 

=>• w-c G2 < w-c Gs 

which is a contradiction with our assumption. Therefore G\ and G2 cannot be neighbors. 

□ 



Remark 4.8. In the two parts of proof of Theorem 4.7 it might be more intuitive to see how 
we define the new graphs using examples. 

• Part (1), prove "if condition. Let m = 4 and n = 3. In the example below, 6j = b\ . 




S2) ( a3l (a4) 



Graph Gi' 



Graph G2' 



Part (2), prove "only if condition. Let m = 4 and n = 3. In the example below, hi = b\ 
and bj = bi- 



an (a2) ( a3) (a4) 





Graph Gi' 
a- 1 ) (a7) (a2) (ai) (a--) 



Graph Gi" Graph Gi 

a .1 1 (ai) (a2) 0) 





Graph G2 -X ~* Graph G2' 

0) (^2) (a^) (^) (a?) (0 



Graph G2" 



Graph G2 
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Theorem 4.9. Notation same as Definition 2.1, Fixm andn. ForV G G Q m ,n, G has n-(2 r 
many neighbors. 



Proof. By Theorem 4.7 for V H G £7 m) n, G and if are neighbors if and only if: 3 bi G -B such 
that pac(bi) ^ pan (pi) and pac{bj) = pajf(bj), for V bj £ B and bj 7^ 6j. 
For a fixed 6j, define graphs: 

★ G', -ff 7 € <5 m) i with symptom B m) i = {bi} such that pac'ibi) = paoih) and pajj'(bi) = 
pa H {bi); 

★ G", if" G Q m i n -i) with symptoms B m r n _i\ = B\{bi} such that paG"{bj) = pac(bj) and 
pa H »(bj) = pa H (bj), V G 

Now suppose G and -ff are neighbors and G' 7^ iT', then G" = H" . By Proposition 3.6, there 
are 2 m graphs in Q m ,i- This means that there are 2 m — 1 possible different H's, and each one 
relate to a different neighbor of G. 

We can use the same strategy for each symptom, i.e. we can find 2 m — 1 neighbors for each 
b{ G B. It is easy to see that these neighbors are all distinct: 

★ H%, H2 are all same with G but pac{bi) 7^ pan 1 (bi) and pac(bj) / paH 2 (bj), where bi, 
bj G B are distinct. Then paH 2 (bi) = pao{bi) 7^ P^Hiibi), so Hi and H2 are different. 

Therefore the total number of neighbors for G is: n • (2 m — 1). □ 

Remark 4.10. Recall that we proved in Theorem |4.1[ for fixed m and n, the dimension of 
the characteristic imset polytope is n ■ (2 m — 1). Thus Theorem 4.9 implies that the number 
of neighbors for each vertex equals to the dimension, i.e. the characteristic imset polytope for 
Bipartite graphs is a simple polytope. In 2000, V. Kaibel and M. Wolff proved that a zero-one 
polytope is simple if and only if it equals to a direct product of zero-one simplices [5]. But here, 
we are going to prove a even stronger result. 



Theorem 4.11. Notation same with Definition 2.1, P m ,n is the direct product of n many 
A 2 m_ 1; i.e. 



Pm,n — A2'"-l X A2171-I X • • • X A2"_l 



n many 

And the %th simplex is P m ,i with the same diseases A and one symptom {bi}. 
Proof. Fix m, we are going to prove the equality holds using induction on n. 



n = 1. Proved in Theorem 4.3 



Fix q G Z + . Suppose the equality holds for P m ,n, V n < q, and we need to prove 
that it also holds for P m ,q- Recall that for Q m ,q, w e have notation for all symptoms: 
B = {h,h,...,b q }. 

First, we want to prove: P m ,n ^ P m ,g-i x P m ,i- 

Similar with the proof of Theorem 4.7, for V G G Q m ,ni we can define graphs: 

* G' £ G m ,(q-i) with symptoms B m r q _]\ = B\{b q } such that pacih) = pacih), 
V h G B m ^ q _ 1 y This implies c G , G "m,q— 1 j 

★ G" G Qm,i with symptom B m ^i = {b q } such that pac>'(b q ) = pac(b q ). This implies 
c-G" £ Pm,l- 

Again, with a proper permutation of coordinates, we can write cq in form of: 

cg = (cg> CQ")- 

Now because the set of vertices of P m ,q is {cg ■ G G Q m ,q}, so for V x G P m ,<j) with the 
same permutation of coordinates, we have: 

x= a G c G = ( Yl ct G c G > , Yl o> G c G »), 

G E Qrn ,q Gdz Qm ,q Qm- , q 
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where < a G < 1, V G € Q m ,q and E a G = 1- 

G&g 

Notice that a G^G f £ P m ,q-i and a G£G" £ P m ,i) the above equality implies 

x G P m , r i x P m> i. Hence: 

Second, we want to prove: P mj9 _i x P m> i C P mj „. 

Let Q m ,q-i has symptoms -B m ,( g _i) = and £/ mil has symptom -B mj i = For 

V G' G Q m ,{q-i) an d G" G & m ,i, we can define G G <? m)(J such that pa G {bi) = pacih), 

V b{ G B m r q _i\, and pa G {b q ) = pa G >i{b q ). We can write cg in form of c G = (c G i cc)- 
Now for V x G P m ,<j-i x P m ,l, by definition of direct product, it can be written as: 

x = ( E Pg'Cg 1 , E 1G"CG") = E E /3g'7g"( c c , c G ») 
G'eg m>g _ a G"eg m ,i G'eg m , q _i G"eg m ,i 

= E E (Pghg") c g , 

G'eS m , q _i G"eg m ,i 

where < P G >, <y G » < 1, V G' G Gm,q-i, G" G £/ m ,i, and EG'ee m , g _i = 1 ' 
SG»eg m ,jG" = 1- 
Notice that 

E ^E (Pg<1g») = E M E 7G")= E /5g' = 1- 
G'e6 m , ? _i G"e5 m ,i G'e5 m , ? _i G"e6 m ,i G'eS m ,g-i 

This leads to x G P m ,g- Hence: 

Pm,g— 1 X P m i C P mrl . 

Now using the assumption we made before, we can finish the proof because: 

Pm,g = Pm,<j-1 x P m A = A2™-1 X • • • X f±2™-\ X A2">_i = A^m-l X • • •xAyM . 

q-1 many Q many 

□ 

4.2. Facets of P mj „. 

Recall that there is a very famous theorem p3] about facets for direct product of simplices. 

Lemma 4.12. [2] Denote P as £/ie direct product of simplices A ai , . . . , A a(i . . TTien every facet 
o/P /ias £/ie /orm of A ai x . . . x A aj _ 1 x F Qi x A ai+1 x . . . x A Qfe , where F ai is a facet of A ai . 

Remark 4.13. This Lemma implies that if we want to study the facets of a direct product 
of simplies, we can simplify the problem as studying facets of each simplex. Now because we 



already proved in Theorem 4.11 that P mn is a direct product of n many P m i, our problem is 
simplified as studying the facets of P m i. Therefore, in the following part of this subsection we 
will assume B = {b\\. 



Assume A = {a\, . . . ,a m } and B = {b{\. As shown in Section [3j the vertices of P m ,i has 
at most 2 m — 1 many non-zero coordinates. We can remark the coordinates as: x = {x s , s C 
A, s 7^ 0}, here each x s corresponds to one coordinate in the characteristic imset c G {s U {b\}). 
Now suppose A m x < b m is the system of inequalities that defines P m ,i, and we can define a 
2 m x 2 m matrix: D m = [b m \ — A m ]. We can denote the elements in D m as (d s t) s cA,tcA such 
that we can rewrite the system of inequalities as: d s $ + EtcA d s tXt > 0, s C i. Then we 
have the expression of 2 m facets of ~P m ,l as following: 

F a = Pm,l n {x : d s + ^2 d st x t = 0}, s C A, 
where the elements d s t, s, t C A can be obtained using the following theorem. 
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Theorem 4.14. The elements in matrix D m can be computed as following: 
• d s t 7^ if and only if s C t; 

This implies that P mj i /ias 2 m facets: 



F„ 



m,l 



n {x : dgQ, + = 0}, s C A. 

tCA,i^0 



What's more, for V sCi, a// vertices o/P TO) i are on F s on/?/ except cg s where paG s {b\) = s. 

Proof. Let ccq = 1. For V s C A, let d s . be the row of D m which has index s, and G s be the 
graph in P m ,i for which paG s {b\) = s. Now we can rewrite the system of inequalities as: 

dstxt = 4(1 x) T > 0, /or VsU 

tCA 

We are going to prove that for V s C i, we can find 2 m — 1 vertices on F s which are linearly 
independent, and this imlies that F s is a facet of P m ,i. Actually we will prove that: {cg 3 ,, s' C 
A, s' / s} C ^ and c Gs ^ F s , i.e. c Gs ,) T = 0, Vs'Ci, s' / s and d s .(l c Gs ) T > 0. 

Notice that for V t C A, cg , (i U {6i}) 7^ if and only if t C pa c<3 (61) = s ', and d s t ^ if and 
only if s C i. Now we have: 



cg s ,) T = 40 + X] d stCG s ,{t U {61}) = d s + ^ d s4 = d 

tCA, t^0 sCtCs', t^0 sCtCs 



si- 



• If s = s', then d s . (1 cg s , ) t 

• If s C s', then cc s ,) r 

• If s s', then 4.(1 c Gs ,) r 



4 S = 1 > 0; 
£ (-1)1*1-1-1 = E (_l)|f| =0 ; 



0. 



t'Cs'\s 



□ 



5. Examples 



Example 5.1 (Only One Disease). For m = 1, by Proposition 3.2, only cg(oiOj), € {1, • • • , n}, 
can be 1. So consider all combination of existence of edge (a\bj), bj £ {1, . . . , n}, it is easy to see 
that the polytope of all possible characteristic imset is the n-dimensional unit hypercube. 
A simple example of n = 3 is given here. 




© © © 
Graph G13 




vertices 
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We can list all possible characteristic imsets in a table. 





T 


ai&i 


a\b 2 


ai& 3 


bib 2 


6263 


6163 


ai6i6 2 


aib 2 b 3 


aihb 3 


616263 


ai 61 6 2 63 


( CG ° \ 




( ° 





























\ 


CGi 




1 
































CG 2 







1 





























CG 3 










1 


























C G 12 




1 


1 





























C G*23 







1 


1 


























C G 13 




1 





1 


























V C Gl23 / 




V 1 


1 


1 























/ 



Example 5.2 (m = 2, n 

characteristic imsets. An example for encoding the subscript is given by Figure [2j 



2). Use Proportion 3.3, we can easily figure out all possible 




Graph G124 



Figure 2. Encode edges ai6i as 1, 0162 as 2, 0261 as 3, 0262 as 4. Graph G124 
is shown as above. 





T 


a\l 2 




ai6 2 


o 2 61 


a 2 b 2 


6162 


010261 


a\a 2 b 2 


016162 


a 2 b\b 2 


a\a 2 b\b 2 


/ C Go \ 




( ° 





























° \ 


CGi 







1 





























CG 2 










1 


























CG 3 













1 























CG 4 
















1 




















CG12 







1 


1 


























C Gl 3 







1 





1 








1 














C G 14 







1 








1 




















C G23 










1 


1 























C G24 










1 





1 








1 











C G34 













1 


1 




















C Gi23 







1 


1 


1 








1 














C Gi34 







1 





1 


1 





1 














C Gi 2 4 







1 


1 





1 








1 











C G234 










1 


1 


1 








1 











V C Gl234 1 


\ 


1 


1 


1 


1 





1 


1 








/ 



Example 5.3 (Facets of a simple example). Fix m = 2 and n = 1. 



16 



JING XI AND RURIKO YOSHIDA 




The matrix D 2 = [&2I — M\- 



Do = 



The system of inequalities which defines P2,i 



s\t 





ai 


d2 


aia 2 







-1 


-1 


1 \ 


ai 





1 





-1 


a 2 








1 


-1 


aia 2 










1 / 







T 




a 2 6i 


aia 2 6i 




s\t 





ai 


a 2 


aia 2 










( 



















_ l _ *^aia2 


>0\ 








1 










ai 








%aia2 


> 


CG 2 









1 







a2 






•^02 


^aia2 


> 


\CGl2 


J 




^ 1 


1 


1 


) 


aia 2 








3?ai<J2 





The facet F a2 is determined by vertices cg , and cg x 



6. Discussion 

In this paper we focus on the diagnosis models and their characteristic imsets. We have shown 
that the dimension of the characteristic imset polytope for the diagnosis models with fixed m 
and n is n ■ (2 m — 1), the explicit description of all edges of the polytope in terms of graphs, 
and the characteristic imset polytope for the diagnosis models with fixed m and n are the direct 
product of n many (2 m — 1) dimensional simplicies. In the following subsection we have listed 
several questions which we are working on. 

6.1. Future work. There are several thing we would like to work on the characteristic imset 
polytopes for diagnosis models P m ,n- First, we would like to find all facets of P m ,„ for any 
m,n > 1. Then in order to analyze sensitivity of the quality criterions and data, we would like 
to compute the normal fan of P m , n . Consider a vertex v of P m , n - A normal cone at v is a cone 
generated by the normal vectors of all facets which contain v. Note that the normal cone at v 
is the set of all cost functions c which give the vertex v as the optima solution for the linear 
programming problem, 

maxc • x such that x G P m ,n- 

The normal fan is the union of normal cones for all vertices of the polytope. It is also interesting 
to consider the mis-specification problem, that is, the behavior of optimal solution when the 
assumption of diagnosis model is wrong. 

Also we would like to study the characteristic imset polytope for trees with iV nodes. 

7. Acknowledgment 
The authors would like to thank M. Studeny and Bernd Sturmfels for their useful advise. 

References 

[1] H. Akaike. Information theory and an extension of the maximum likelihood principle. In B. Petrox & F. Caski 
(Eds.), Proceedings of the second international symposium on information theory, pages 267-281, 1973. 

[2] S. A. Andersson, D. Madigan, and M. D. Perlman. A characterization of markov equivalence classes for 
acyclic digraphs. Annals of Statistics, 25:505-541, 1997. 

[3] N. Friedman, M. Linial, I. Nachman, and D. Pe'er. Using bayesian networks to analyze expression data. 
Journal of Computational Biology, 7:601-620, 2000. 



THE CHARACTERISTIC IMSET POLYTOPE FOR DIAGNOSIS MODELS 



17 



[4] X. Jiang, R.E. Neapolitan, M.M. Barmada, and S. Visweswaran. Learning genetic epistasis using bayesian 

network scoring criteria. BMC Bioinformatics, 12(89), 2011. 
[5] V. Kaibel and M. Wolff. Simple 0/1-polytope. Europ. J. Combinatorics, 21:139-144, 2000. 
[6] S. L. Lauritzen. Graphical Models. Clarendon Press, 1996. 

[7] P.J.F. Lucas. Bayesian model-based diagnosis. International Journal of Approximate Reasoning, 27(2):99- 
119, 2001. 

[8] G. Schwarz. Estimating the dimension of a model. Annals of Statistics, 6:461-464, 1978. 

[9] M. A. Shwc, D. E. Hcckerman, M. Henrion, H. P. Lehmann, and G. F. Cooper. Probabilistic diagnosis using 

a reformulation of the internist- 1/qmr knowledge base: I. the probabilistic model and inference algorithms. 

Methods of Information in Medicine, 30:241-255, 1991. 
[10] M. Studeny. Probabilistic Conditional Independence Structures. Springer Verlag, 2005. 

[11] M. Studeny, R. Hemmecke, and S. Lindner. Characteristic imset: a simple algebraic representative of a 
bayesian network structure. In Proceedings of the 5th European Workshop on Probabilistic Graphical Models, 
pages 257-264, 2010. 

[12] M. Studeny, J. Vomlel, and R. Hemmecke. A geometric view on learning bayesian network structures. Inter- 
national Journal of Approximate Reasoning, 51(5):573-586, 2010. 

[13] J. Uebersax. Pgenetic counseling and cancer risk modeling: An application of bayes nets, marbella. Spain: 
Ravenpack International, 2004. 

[14] G. Ziegler. Lectures on Polytopes. Springer Verlag, New York, New York, 1994. 



